Skip to main content

Download data and software

How to start

We suggest starting with the installation of the software and then proceeding to follow the instructions for downloading and extracting the data.

A. Software

All the necessary software to run the recipes and make your own submission is available on our Clarity-Cadenza GitHub repository.

The official code for the Second Cadenza Challenge was released in version v0.6.0. To avoid any conflict, we highly recommend for you to work using this version and not with the code from the main branch. To install this version you have three options

1. Download the files of the release v0.6.0

From inside the directory, run:

pip install -e .
2. Clone the repository and checkout version v0.6.0
git clone https://github.com/claritychallenge/clarity.git
git checkout tags/v0.6.0

cd clarity
pip install -e .
3. Install pyclarity version 0.6.0 from PyPI
pip install pyclarity==0.6.0

B. Data

B.1 Download the data for Task 1

The data is available in one packages, please complete this form to request access.

  • cadenza_cad2_task1_train.v1_0.tar.gz [15 GB] - audio data and metadata.

Unpack the packages

After downloading the packages Unpack the packages under the same root directory as:

tar -xvzf <PACKAGE_NAME>

B.2 Download the data for Task 2

The data is available in several packages, please complete this form to request access.

  • CadenzaWoodwind.zip [7.5 GB] - CadenzaWoodwind audio dataset.
  • EnsembleSet_Mix_1.zip [5.2 GB] - Small subset of EnsembleSet.
  • metadat.zip [209 kB] - metadata for the system.
  • Stereo_Reverb_Real_Data_For_Tuning.zip [182 MB] - Small subset of real recordings for tuning.

Unpack the packages

After downloading the packages, save them in the same directory and run the script process_zenodo_download.py found with the baseline on GitHub.